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We study statistical inferences for a class of modulated stationary processes with time-dependent 
variances. Due to non-stationarity and the large number of unknown parameters, existing 
methods for stationary, or locally stationary, time series are not applicable. Based on a self- 
normalization technique, we address several inference problems, including a self-normalized cen- 
tral limit theorem, a self-normalized cumulative sum test for the change-point problem, a long- 
run variance estimation through blockwise self-normalization, and a self-normalization-based 
wild bootstrap. Monte Carlo simulation studies show that the proposed self-normalization-based 
methods outperform stationarity-based alternatives. We demonstrate the proposed methodology 
using two real data sets: annual mean precipitation rates in Seoul from 1771-2000, and quarterly 
U.S. Gross National Product growth rates from 1947-2002. 

Keywords: change-point analysis; confidence interval; long-run variance; modulated stationary 
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1. Introduction 

In time series analysis, stationarity requires that dependence structure be sustained over 
time, and thus we can borrow information from one time period to study model dy- 
namics over another period; see Fan and Yao [20] for nonparametric treatments and 
Lahiri [29] for various resampling and block bootstrap methods. In practice, however, 
many climatic, economic and financial time series are non-stationary and therefore chal- 
lenging to analyze. First, since dependence structure varies over time, information is 
more localized. Second, non-stationary processes often require extra parameters to ac- 
count for time- varying structure. One way to overcome these issues is to impose certain 
local stationarity; see, for example, Dahlhaus [15] and Adak [1] for spectral representation 
frameworks and Dahlhaus and Polonik [16] for a time domain approach. 

In this article we study a class of modulated stationary processes (see Adak [1]) 

X,^fi + ate,, i = l,...,n, (1.1) 

where are stationary time series with zero mean, and cr^ > are unknown constants 
adjusting for time-dependent variances. Then Xi oscillates around the constant mean /x, 
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whereas its variance changes over time in an unknown manner. In the special case of 
ai = 1, (1.1) reduces to stationary case. If ai = s{i/n) for a Lipschitz continuous func- 
tion s{t) on [0,1], then (1.1) is locally stationary. For the general non-stationary case 
(1.1), the number of unknown parameters is larger than the number of observations, 
and it is infeasible to estimate ai. Due to non-stationarity and the large number of 
unknown parameters, existing methods that are developed for (locally) stationary pro- 
cesses are not applicable, and our main purpose is to develop new statistical inference 
techniques. 

First, we establish a uniform strong approximation result which can be used to de- 
rive a self-normalized central limit theorem (CLT) for the sample mean X of (1.1). For 
stationary case <7i = l, by Fan and Yao [20], under mild mixing conditions, 

oo 

V^{X - H)^ N{0,T^), whereT2=7o-F2^7fe and7fe = Cov(e^,ei+fe). (1.2) 

fe=i 

For the modulated stationary case (1.1), it is non-trivial whether y/n{X — /i) has a 
CLT without imposing further assumptions on ai and the dependence structure of e^. 
Moreover, even when the latter CLT exists, it is difficult to estimate the limiting variance 
due to the large number of unknown parameters; see De Jong and Davidson [18] for 
related work assuming a near-epoch dependent mixing framework. Zhao [41] studied 
confidence interval construction for fi in (1.1) by assuming a block- wise asymptotically 
equal cumulative variance assumption. The latter assumption is rather restrictive and 
essentially requires that block averages be asymptotically independent and identically 
distributed (i.i.d.). In this article, we deal with the more general setting (1.1). Under a 
strong invariance principle assumption, we establish a self-normalized CLT with the self- 
normalizing constant adjusting for time-dependent non-stationarity. The obtained CLT 
is an extension of the classical CLT for i.i.d. data or stationary time series to modulated 
stationary processes. Furthermore, we extend the idea to linear combinations of means 
over different time periods, which allows us to address inference regarding mean levels 
over multiple time periods. 

Second, we study the wild bootstrap for modulated stationary processes. Since the sem- 
inal work of Efron [19], a great deal of research has been done on the bootstrap under 
various settings, ranging from bootstrapping for i.i.d. data in Efron [19], wild bootstrap- 
ping for independent observations with possibly non-constant variances in Wu [39] and 
Liu [30] , to various block bootstrapping and resampling methods for stationary time series 
in Kiinsch [27], Politis and Romano [34], Biihlmann [12] and the monograph Lahiri [29]. 
With the established self-normalized CLT, we propose a wild bootstrap procedure that is 
tailored to deal with modulated stationary processes: the dependence is removed through 
a scaling factor, and the non-constant variance structure of the original data is preserved 
in the wild bootstrap data-generating mechanism. Our simulation study shows that the 
wild bootstrap method outperforms the widely used stationarity-based block bootstrap. 

Third, we address change-point analysis. The change-point problem has been an active 
area of research; see Pcttitt [32] for proportion changes in binary data, Horvath [25] for 
mean and variance changes in Gaussian observations, Bai and Perron [8] for coefficient 
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changes in linear models, Aue et al. [6] for coefficient changes in polynomial regression 
with uncorrelated errors, Aue et al. [7] for mean change in time series with stationary er- 
rors, Shao and Zhang [37] for change-points for stationary time series and the monograph 
by Csorgo and Horvath [14] for more discussion. Most of these works deal with station- 
ary and/or independent data. Hansen [24] studied tests for constancy of parameters in 
linear regression models with non-stationary regressors and conditionally homoscedastic 
martingale difference errors. Here we consider 

Hq: Xi = Hi + aiei,ni^--- = Hn, Ha- Hi = ■ ■ ■ = ^i.J ^ ^i.J+l = ■ ■ ■ = ^J.n, (1-3) 

where J is an unknown change point. The aforementioned works mainly focused on 
detecting changes in mean while the error variance is constant. On the other hand, 
researchers have also realized the importance of the variance/covariance structure in 
change point analysis. For example, Inclan and Tiao [26] studied change in variance for 
independent data, and Aue et al. [5] and Berkes, Gombay and Horvath [10] considered 
change in covariance for time series data. To our knowledge, there has been almost no 
attempt to advance change point analysis under the non-constant variances framework in 
(1.3). Andrews [4] studied change point problem under near-epoch dependence structure 
that allows for non-stationary processes, but his Assumption 1(c) on page 830 therein 
essentially implies that the process has constant variance. The popular cumulative sum 
(CUSUM) test is developed for stationary time series and does not take into account the 
time-dependent variances. Using the self-normalization idea, we propose a self-normalized 
CUSUM test and a wild bootstrap method to obtain its critical value. Our empirical 
studies show that the usual CUSUM tests tend to over-reject the null hypothesis in the 
presence of non-constant variances. By contrast, the self-normalized CUSUM test yields 
size close to the nominal level. 

Fourth, we estimate the long-run variance in (1.2). Long-run variance plays an es- 
sential role in statistical inferences involving time scries. Most works in the literature deal 
with stationary processes through various block bootstrap and subsampling approaches; 
see Carlstein [13], Kiinsch [27], Politis and Romano [34], Gotze and Kiinsch [21] and the 
monograph Lahiri [29]. De Jong and Davidson [18] established the consistency of ker- 
nel estimators of covariance matrices under a near epoch dependent mixing condition. 
Recently, Miiller [31] studied robust long-run variance estimation for locally stationary 
process. For model (1.1), the error process {ci} is contaminated with unknown standard 
deviations {o"i}, and we apply blockwise self-normalization to remove non-stationarity, 
resulting in asymptotically stationary blocks. 

Fifth, the proposed methods can be extended to deal with the linear regression model 

X, = U,(3 + a^e,, (1.4) 

where Ui = (wj^i, . . . ,Ui_p) are deterministic covariates, and /? = (/3i, . . . ,/?p)' is the un- 
known column vector of parameters. For p = 2, Hansen [23] established the asymptotic 
normality of the least-squares estimate of the slope parameter under a fairly general 
framework of non-stationary errors. While Hansen [23] assumed that the errors form a 
martingale difference array so that they are uncorrelated, the framework in (1.4) is more 
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general in that it allows for correlations. On the other hand, Hansen [23] allowed the 
conditional volatilities to follow an autoregressive model, hence introducing stochastic 
volatilities. Phillips, Sun and Jin [33] considered (1.4) for stationary errors, and their 
approach is not applicable here due to the unknown non-constant variances af . In Sec- 
tion 2.6 we consider self-normalized CLT for the least-squares estimator of /3 in (1.4). 
In the polynomial regression case u^.r = {i/nY^-^, Auc et al. [6] studied a likelihood- 
based test for constancy of f3 in (1.4) for uncorrelated errors with constant variance. 
Due to the presence of correlation and time- varying variances, it is more challenging 
to study the change point problem for (1.4) and this is beyond the scope of this arti- 
cle. 

The rest of this article is organized as follows. We present theoretical results in Sec- 
tion 2. Sections 3-4 contain Monte Carlo studies and applications to two real data sets. 

2. Main results 

For sequences {a„} and {bn}, write a„ = 0(6„), a„ = o(6„) and a„ x 6„, respectively, if 
|an/&n| < ci, ttn/bn — >■ and C2 < \an/bn\ < C3, for some constants < ci, 02,03 < 00. For 
q> and a random variable e, write e S £^ if \\e.\\q '■= {-E([e[^)}^/'? < 00. 

2.1. Uniform approximations for modulated stationary processes 

In (1.1), assume without loss of generality that E{ei) = and E{ef) = 1 so that {ci} and 
{e| — 1} are centered stationary processes. With the convention Sq ^ Sq ^ 0, define 

i i 

S,=Y.e, and 5*-^(e^2-l), z = l,2,.... (2.1) 

Assumption 2.1. There exist standard Brownian motions {Bt} and {B^} such that 
max [5*4 - tB,;[ =Oa.s.(^«) and max [S* - t*B*[ = Oa.s.(A„), (2.2) 

l<i<n l<i<n 

where A„ is the approximation error, and r*^ are the long-run variances o/{e,;} and 
{e| — 1}, respectively. Further assume > to avoid the degenerate case = 0. 

The uniform approximations in (2.2) are generally called strong invariance principle. 
The two Brownian motions {Bt} and {B^} may be defined on different probability spaces 
and hence are not jointly distributed, which is not an issue because our argument does 
not depend on their joint distribution. To see how to use (2.2), under Hq in (1.3), consider 

j j 
Fj^.iiXj- IJ-) and V^j^^iX.-XjY, where = j-^ ^ X,. (2.3) 

i=l 1=1 
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Theorem 2.1 below presents uniform approximations for Fj and V^. Define 

n n 

r„ = |a„| + ^|fT,-(T,_i| and < = | + ^ [a^ - a^, J, (2.4) 

i=2 1=2 

i=i \i=i / 



Theorem 2.1. Lef f^.^j /loW. -For any cG (0,1], the following uniform approximations 
hold: 



max 

cn<j<n 



Oa.s.(?'riA„), 



max IZ' - S^l = Op{(r2 A2 + S^^J/n + + ^* 



cn<j<n 



(2.6) 
(2.7) 



Theorem 2.1 provides quite general results under (2.2). We now discuss sufficient con- 
ditions for (2.2). Shao [36] obtained sufficient mixing conditions for (2.2). In this article, 
we briefly introduce the framework in Wu [40] . Assume that has the causal representa- 
tion Ci = G{. . . ,£i_i,ei), where are i.i.d. innovations, and G is a measurable function 
such that Ci is well defined. Let {e[i\i!r--^ be an independent copy of {ei}i^z- Assume 

oo 

^i||ei - e-lls < oo, where e- = G'(. .., e_i, Eg, £i, ei_i, e^). (2.8) 
1=1 

Proposition 2.1 below follows from Corollary 4 in Wu [40]. 

Proposition 2.1. Assume that (2.8) holds. Then (2.2) holds with A„ = ri^/"* log(n) , the 
optimal rate up to a logarithm factor. 

For linear process = X^jlo'^i^^-i ''^ith € and E{ei) = 0, \\ei — e'Jjs = ||£o ~ 
eollsjail- If YlTLi < then (2.2) holds with A„ = n^/'^\og{n). For many nonlinear 
time series, He; — e'^\\% decays exponentially fast and hence (2.8) holds; see Section 3.1 of 
Wu [40]. From now on we assume (2.2) holds with A„ =ni/^log(?7.). 

Remark 2.1. If arc i.i.d. with E{ei) = and G C for some 2 < g < 4, the celebrated 
"Hungarian embedding" asserts that X]j=i satisfies a strong invariance principle with 
the optimal rate Oa_,s.{n^^'^)- Thus, it is necessary to have the moment assumption G 
in Proposition 2.1 in order to ensure strong invariance principles for both Si and S* in 
(2.1) with approximation rate n^/'^\og{n). On the other hand, one can relax the moment 
assumption by loosening the approximation rate. For example, by Corollary 4 in Wu [40], 
assume G C^'^ for some q> 2 and Yl^i ~ ^V\2q < co, then (2.2) holds with A„ = 

^l/min(g,4) log(n). 

As shown in Examples 2.1-2.3 below, r„ and r* in (2.4) often have tractable bounds. 
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Example 2.1. If a; is non-decreasing in z, then (T„ < r„ < 2cr„ and cr^ < < 2ct^. If 
(Ti is non-increasing in i, then r„ = cti and r* = cr^. If cTi are piecewise constants with 
finitely many pieces, then r„,r* = 0(1). 

Example 2.2. Let ai = s{i/rC) for 7 e [0, 1] and a Lipschitz continuous function s{t),t S 
[0, CJo), sup(g[Q s{t) < 00. Then r„, r* = 0(n-'^~'''). If 7 = 1, we obtain a locahy stationary 
case with the time window i/n£ [0, 1] ; if 7 € [0, 1) , we have the infinite time window [0, 00) 
as n/nH ^ 00, which may be more reasonable for data with a long time horizon. 

Example 2.3. If ai = i^L{i) for a slowly varying function L{-) such that L{cx) / L{x) — 1 
as a; ^ 00 for all c > 0. Then we can show r„ = 0{n^ L{n)} or 0(1) and r* = 0{ii?^ [n)} 
or 0(1), depending on whether /3 > or /3 < 0. For the boundary case /? = 0, assume 
L(i + l)/i(i) = l + 0(l/i) uniformly, then r„ = L(n) 0(1) ^"^3 ^(i)A = 0{log(n) x 
maxi<i<„i(i)}. Similarly, r* = 0{log(ri,) maxi<j<„ ^^(i)}. 

2.2. Self-normalized central limit theorem 

In this section we establish a self- normalized CLT for the sample average X. To under- 
stand how non-stationarity makes this problem difficult, elementary calculation shows 

" 2 

Var{^^(X-/.)} = ^^a2 + - ^ a,a,7,_, r^, (2.9) 

i—l l<i<j<n 

where 7^ = Cov(eo,efc). In the stationary case ai = 1, under condition X^fcLo \"fk\ < 00, 
— ^ T^, the long-run variance in (1.2). For non-constant variances, it is difficult to 
deal with directly, due to the large number of unknown parameters and complicated 
structure. See Dc Jong and Davidson [18] for a kernel estimator of under a near-epoch 
dependent mixing framework. 

To attenuate the aforementioned issue, we apply the uniform approximations in Theo- 
rem 2.1. Assume that (2.10) below holds. Note that the increments Bi — Bi-i of standard 
Brownian motions are i.i.d. standard normal random variables. By (2.6), n{X — fi) is 
equivalent to A^(0,r^E^) in distribution. By (2.7), li„/S„ ^ 1 in probability. By Slut- 
sky's theorem, we have Proposition 2.2. 

Proposition 2.2. Let (2.2) hold with A„ ni/4log(ri). For r„,r;,S2,E*2 in (24)- 
(2.5), assume 

S„ = r„A„/I]„ + (r:A„ + ^ 0. (2.10) 

Recall in (2.3). Then as n— >-oo, n(X — /i)/V_,j => iV(0, r^) . Consequently, a [1 — a) 
asymptotic confidence interval for fi is X ± z^/2Ty_^-^/n, where f is a consistent estimate 
of T (Section 2.5 below), and Za/2 is (1 — a/2) standard normal quantile. 

Proposition 2.2 is an extension of the classical CLT for i.i.d. data or stationary pro- 
cesses to modulated stationary processes, li Xi are i.i.d., then n{X — y) /V_^ ^ N{Q, 1). In 
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Proposition 2.2, can be viewed as the variance inflation factor due to the dependence 
of {ei}. For stationary data, the sample variance V_^/n is a consistent estimate of the 
population variance. Here, for non-constant variances case (1.1), by (2.7) in Theorem 2.1, 
y^/n can be viewed as an estimate of the time-average "population variance" S^/n. So, 
we can interpret the CLT in Proposition 2.2 as a self- normalized CLT for modulated sta- 
tionary processes with the self- normalizing term adjusting for non-stationarity due 
to ui, . . . , (Tn and r^, accounting for dependence of {e^}. Clearly, parameters cti, . . . , cr„ 
are canceled out through self- normalization. Finally, condition (2.10) is satisfied in Ex- 
ample 2.2 with 7 > 3/4 and Example 2.3 with (3 > -1/4. 

In classical statistics, the width of confidence intervals usually shrinks as sample size 
increases. By Proposition 2.2 and Theorem 2.1, the width of the constructed confidence 
interval for fi is proportional to y_^/n or. equivalently, T,n/n. Thus, a necessary and 
sufficient condition for shrinking confidence interval is crf/n^ — 0, which is satisfied 

if (Ti = o{y/i). An intuitive explanation is as follows. For i.i.d. data, sample mean converges 
at a rate of 0{y/n). In (1.1), if di grows faster than 0{y/i), the contribution of a new 
observation is negligible relative to its noise level. 

Example 2.4- If x with /? e [0, 1/2), the length of confidence interval is propor- 
tional to S,i/n X n^~^/^. In particular, if ci < Ci < C2 for some positive constants ci and 
C2, then E„/n achieves the optimal rate 0{n~^^^). li ai x log(i), then I]„/n x log(n)/-y/n. 

The same idea can be extended to linear combinations of means over multiple time peri- 
ods. Suppose we have observations from k consecutive time periods Ti, . . . , Tfc, each of the 
form (1.1) with different means, denoted by yUi, . . . ,^k, and each having time-dependent 
variances. Let u = /3i^i + • • • + /3fc/ifc for given coefficients /3i, . . . , /?fc. For example, if we 
are interested in mean change from 7i to 72, we can take u = — fJ-i] if we are inter- 
ested in whether the increase from 75 to Ti is larger than that from 7i to 72, we can 
let V = (/i4 — /is) — (/i2 ^ Ml)- Proposition 2.3 below extends Proposition 2.2 to multiple 
means. 



Proposition 2.3. Let v = /3i/xi -I- • • • -I- l^kfJ-k- For Tj , denote its sample size Uj and its 
sample average X(j). Assume that (2.10) holds for each individual time period Tj and, 
for simplicity, that ui, . . . ,nk are of the same order. Then 



A^(0,t2), where Kl 



2.3. Wild bootstrap for self-normalized statistic 



Recall cFiCi in (1.1). Suppose we arc interested in the self- normalized statistic 
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For problems with small sample sizes, it is natural to use bootstrap distribution instead 
of the convergence Hn N{0, t^) in Proposition 2.2. Wu [39] and Liu [30] have pioneered 
the work on the wild bootstrap for independent data with non-identical distributions. We 
shall extend their wild bootstrap procedure to the modulated stationary process (1.1). 

Let {ai} be i.i.d. random variables independent of {ci} satisfying ai G C^,E{ai) = 
0,i?(af) = 1. Define the self-normalized statistic based on the following new data: 

H* = — ^^=1 ^' where = UiCiUi and f = ^ ^" . 



Clearly, inherits the non-stationarity structure of GiCi by writing = Uie* with 
e* = eiUi. On the other hand, for the new error process {e*}, E{e*^) = E{ef) = 1 and 
Cov(e*, e*) = for iy^j. Thus, {e*} is a white noise sequence with long-run variance one. 
By Proposition 2.2, the scaled version Hn/r iV(0, 1) is robust against the dependence 
structure of {e^}, so we expect that iJ* should be close to Hn/r in distribution. 

Theorem 2.2. Let the conditions in Proposition 2.2 hold. Further assume 

Cn \ ^ / " \ 

Y.-') (E-^'j (2-11) 

Let T he a consistent estimate of t. Denote by P* the conditional law given {e^}. Then 
sup\F*{H*<x)-F{Hn/f<x)\^0, in probability. (2.12) 

Theorem 2.2 asserts that, H* behaves like the scaled version Hn/r, with the scaling 
factor f coming from the dependence of {e^}. Here we use the sample mean X in (1.1) 
to illustrate a wild bootstrap procedure to obtain the distribution of n(X — fJ.)/ {Ty_,-J in 
Proposition 2.2. 

(i) Apply the method in Section 2.5 to Xi, . . . ,X„ to obtain a consistent estimate f 
of T. 

(ii) Subtract the sample mean X from data to obtain ei = Xi — X ,i = 1, . . . ,n. 

(iii) Generate i.i.d. random variables ai, . . . ,a„ satisfying EijXi) = 0,i?(af ) = 1. 

(iv) Based on in (ii) and Ui in (iii), generate bootstrap data = Ciai, and compute 



^)^2 



where is a long-run variance estimate (see Section 2.5) for bootstrap data 
(v) Repeat (iii)-(iv) many times and use the empirical distribution of those realiza- 
tions of as the distribution of n{X — ^j)/{Ty_^). 
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The proposed wild bootstrap is an extension of that in Liu [30] for independent data 
to modulated stationary case, and it has two appealing features. First, the scaling factor 
f makes the statistic independent of the dependence structure. Second, the bootstrap 
data-generating mechanism is adaptive to unknown time-dependent variances {(jf}. For 
the distribution of in step (iii), we use P(ai = —1) = P(Q:i = 1) = 1/2, which has some 
desirable properties. For example, it preserves the magnitude and range of the data. As 
shown by Davidson and Flachaire [17], for certain hypothesis testing problems in linear 
regression models with symmetrically distributed errors, the bootstrap distribution is 
exactly equal to that of the test statistic; see Theorem 1 therein. 

For the purpose of comparison, we briefly introduce the widely used block bootstrap 
for a stationary time series {Xi} with mean jj.. By (1.2), y/n{X — /i) ^ iV(0,T^). Suppose 
that we want to bootstrap the distribution of ^/n{X — fi) . Let fc„ , ^„ , Ii , . . . , be defined 
as in Section 2.5 below. The non-overlapping block bootstrap works in the following way: 

(i) Take a simple random sample of size In with replacement from the blocks 
Ii, . . . ,Ie^, and form the bootstrap data X^, . . . ,X^,,n' = knin, by pooling to- 
gether XiS for which the index i is within those selected blocks. 

(ii) Let X'' be the sample average of X^ . . . Compute S„ = V^{X'' ~ E*{X'')}, 

where E*{X^) = Y^^=i ^i/n' is the conditional expectation of X'' given {Xi}- 

(iii) Repeat (i)-(ii) many times and use the empirical distribution of S„'s as the dis- 
tribution of \/n(X — /i). 

In step (ii), another choice is the studentized version S„ = \/n'{X^ — E* {X'')} /t'' , 
where is a consistent estimate of r based on bootstrap data. Assuming stationarity and 
fc„ — cx), the blocks arc asymptotically independent and share the same model dynamics 
as the whole data, which validates the above block bootstrap. We refer the reader to 
Lahiri [29] for detailed discussions. For a non-stationary process, block bootstrap is no 
longer valid, because individual blocks are not representative of the whole data. By 
contrast, the proposed wild bootstrap is adaptive to unknown dependence and the non- 
constant variances structure. 



2.4. Change point analysis: Self-normalized CUSUM test 

To test a change point in the mean of a process {Xi}, two popular CUSUM- type tests 
(see Section 3 of Robbins et al. [35] for a review and related references) are 

max ""''f-^^^^^' and = max f-^\Sx{j)\, (2.13) 

where is a consistent estimate of the long-run variance of {-'fi}, and 

^^^^^ = ('-i)t.^^-i E (2-14) 
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Here c > (c = 0.1 in our simulation studies) is a small number to avoid the boundary 
issue. For i.i.d. data, j(l — j/n) is proportional to the variance of Sxij), so is a 
studentizcd version of T,j. For i.i.d. Gaussian data, is equivalent to likelihood ratio 
test; sec Csorgo and Horvath [14]. Assume that, under null hypothesis, 

ln-'^/^J2l^'^-^(^^)]\ ^r{Bt}o<t<i, in the Skorohod space (2.15) 

I i=l ) 0<t<l 

for a standard Brownian motion {i?(}t>o- The above convergence requires finite- 
dimensional convergence and tightness; see Billingsley [11]. By the continuous mapping 
theorem, ^ maxc<t<i-c \Bt - tBi\/y/t{l -t) and T^/^/n^ ma.Xc<t<i-c \Bt -tBi\. 

For the modulated stationary case (1.3), (2.15) is no longer valid. Moreover, since 
and do not take into account the time-dependent variances af, an abrupt change 
in variances may lead to a false rejection of Hq when the mean remains constant. For 
example, our simulation study in Section 3.3 shows that the empirical false rejection 
probability for and is about 10% for nominal level 5%. To alleviate the issue of 
non-constant variances, we adopt the self-normalization approach as in previous sections. 
Recall Fj and y_j in (2.3). For each fixed cn < j < (1 — c)n, by Theorem 2.1 and Slutsky's 
theorem, 

Pj/^j =^ N{0,t'^) in distribution, assuming the negligibility of the approxima- 
tion errors. Therefore, the self-normalization term can remove the time-dependent 
variances. In light of this, we can simultaneously self-normalize the two terms -^i 
and (2-14) and propose the self-normalized test statistic 

tSN^ max f-'\T„{j)\, where r„(j) ^ ^^^^^ (2.16) 

^Jil-J/nrv'^+iJ/n)W'^ 

Here, is defined as in (2.3), = Etj+ii^^ -^i? ^ith = (^-.7)"' Etj+i ^- 
Theorem 2.3. Assume (2.2) holds. Let5n~^0 be as in (2.10). Under Hi^, we have 

max |r„(j)-rf„(j)| =0p((5„): 

cn<j<(l — c)n 



where 

TnU) = 



By Theorem 2.3, under Hq, T^^ is asymptotically equivalent to ina.Xcn<j<(i-c)n \ Tn{j)\- 
Due to the self- normalization, for each j , the time-dependent variances are removed and 
TnU) ^ ^(0> 1) has a standard normal distribution. However, T„(j) and T„(j') are corre- 
lated for j 7^ j'. Therefore, {T'„(j)} is a non-stationary Gaussian process with a standard 
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normal marginal density. Due to the large number of unknown parameters ctj; , it is infea- 
sible to obtain the null distribution directly. On the other hand, Theorem 2.3 establishes 
the fact that, asymptotically, the distribution of T^^ in (2.16) depends only on cti, . . . , (7„ 
and is robust against the dependence structure of {ei}, which motivates us to use the 
wild bootstrap method in Section 2.3 to find the critical value of T^^ . 

(i) Compute T„{j) and find J = argmax^„<j<(i_^)„ \Tn{j)\- 

(ii) Divide the data into two blocks Xi, . . .,Xj and Xj^-^, . . . , X„. Within each block, 
subtract the sample mean from the observations therein to obtain centered data. 
Pool all centered data together and denote them by ei, . . . , e„. 

(iii) Based on ei, . . . , e„, obtain an estimate f of t. See Section 2.5 below. 

(iv) Compute the test statistic T^^ in (2.16). 

(v) Based on in (ii), use the wild bootstrap method in Section 2.3 to generate 
synthetic data ^i, . . . and use (i)-(iv) to compute the bootstrap test statistic 

based on the bootstrap data , . . . , 

(vi) Repeat (v) many times and find (1 — a) quantilc of those T^s. 

As argued in Section 2.3, the synthetic data-generating scheme (v) inherits the time- 
varying non-stationarity structure of the original data. Also, the statistic T^^ is robust 
against the dependence structure, which justifies the proposed bootstrap method. If Hq 
is rejected, the change point is then estimated by J = argmaXj,„<j<(]^_^-|„ |r,i(j)|. 

If there is no evidence to reject Hq , we briefly discuss how to apply the same methodol- 
ogy to test Hq: a\ = ■ ■ ■ = g j ^ cr,/+i = • • • = (T„, that is, whether there is a change point 
in the variances cr|. By (1.1), we have {Xi — n)'^ ~ + (rfd, where ~ ef ~1 has mean 
zero. Therefore, testing a change point in the variances af of Xi is equivalent to testing 
a change point in the mean of the new data Xi = {Xi — XY . 

2.5. Long-run variance estimation 

To apply the results in Sections 2.2-2.4, we need a consistent estimate of the long-run 
variance t^. Most existing works deal with stationary time series through various block 
bootstrap and subsampling approaches; see Lahiri [29] and references therein. Assuming 
a near-epoch dependent mixing condition, De Jong and Davidson [18] established the 
consistency of a kernel estimator of Var(^"^-^ Xi), and their result can be used to esti- 
mate in (2.9) for the CLT of ^Jn{X — fi). However, for the change point problem in 
Section 2.4, we need an estimator of the long-run variance of the unobservable process 
{ei}, so the method in De Jong and Davidson [18] is not directly applicable. 

To attenuate the non-stationarity issue, we extend the idea in Section 2.2 to blockwise 
self- normalization. Let A:„ be the block length. Denote by In = L"-/^nJ the largest integer 
not exceeding n/fc„. Ignore the boundary and divide 1, . . . , n into in blocks 



={(.?■ - l)fc„ + 1, . . . , jfe„}, j = 1, . . . 



(2.17) 
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Recall the overall sample mean X. For each block j, define the self- normalized statistic 
= where X{j) = ^ ^ X„V'{j) = ^ [X, - (2.18) 

By Proposition 2.2, the self-normalized statistics Di, . . . , D^^ ^ N{0, r^) arc asymptoti- 
cally i.i.d. Thus, we propose estimating by 



As in (2.4)-(2.5), we define the quantities on block j 



S20)=E^f and S*^(j)=(E^f)'^'- (2-21) 
Theorem 2.4. Let (2.2) hold with An=n^/^\og{n). Recall r^.Y^n in (2.4)-(2.5). Define 



A/„ = — + max — —T h max . (2.22) 

/c„ i<j<f„ E2(j) i<j<i?„ S(j) 



Assume that r„A„/S„ — > bkc? 

Xn = C^/^ + log(r.)Af„ + VbiR^ E ^ + f E ^ 0- (2-23) 
T/ien — = Op(x„)- Consequently, t is a consistent estimate of t. 



Consider Example 2.2 with 7 e [0,1). Then Xn ^ y/\og{n)/£n + \og^{n){n^/'^ /^/k^^ + 
n^/^-^ Ikn + y/k^n^/^~^). For 7 e (3/4,1), it can be shown that the optimal rate is 
Xn ^ n~^^^\og^^^{n) when fc„ x n"^/"* log'^^^ (n) . In Example 2.3 with cr,; = i^ for some 
/3 € [0, 1), elementary but tedious calculations show that the optimal rate is 

n- 1/8 log5/4 (n) , fc„ X log^/^ („) , 
/3e [0,3/4], 

^" n('3-l)/(5-4/?){log(n)}(8(l-'3))/(5-4«^ yfc„ X n(4-5-4«/(5-4^) {log(„)}4/(5-4/i) ^ 

/3e (3/4,1). 
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2.6. Some possible extensions 

The self-normahzation approaches in Sections 2.2-2.5 can be extended to linear regression 
model (1.4) with modulated stationary time series errors. The approach in Phillips, Sun 
and Jin [33] is not applicable here due to non-stationarity. For simplicity, we consider 
the simple case that p = 2,Ui = {l,i/n), and j3 = (/3o, /3i)'. Hansen [23] studied a similar 
setting for martingale difference errors. Denote by /3q and /3i the simple linear regression 
estimates of /3o and /3i given by 

/3i^ "^jry^;:^^^'^/r^^' and Po^X-Mn + l)/{2n). (2.24) 

Then simple algebra shows that 

2 " " 6 " 

/3o - /3o ^2 _ „ X!*^^" - 3i + l)cr^ei, /3i - /3i = _ ^ y^(2z - n - l)aiei. 



i=l i=l 

The latter expressions arc linear combinations of {e^}. Thus, by the same argument in 
Proposition 2.2 and Theorem 2.1. wc have self-normalized CLTs for /3o and /3i. 

Theorem 2.5. Let ~ (2n — 3i + l)ai and s^.i ~ {2i — n — l)ai. Assume that 
{sifl}i<i<n cind {si_i}i<i<„ satisfy condition (2.10). Then as n^oo, 



2K,o 
6K,i 



N{0,t'), where V^o = y (2?! - 3^ + 1)^(X,; - - /3ii/") 



1=1 



N{0,T^), where V^^^y {2i - n - iy{X, - - Pii/nY 



1=1 



The long-run variance can be estimated using the idea of blockwise self-nor- 
malization in Section 2.5. Let kn,£n and Xj be defined as in Section 2.5. Then we propose 

= — 2^ Dj, where Dj = ^ ' (2.25) 



VE.ei,(^.-/3o-/3i*/n) 

Here, . . . , Di^ are asymptotically i.i.d. normal random variables with mean zero and 
variance r^. Consistency can be established under similar conditions as in Theorem 2.4. 

For the general linear regression model (1.4), the linearly weighted average structure 
of linear regression estimates allows us to obtain self- normalized CLTs as in Theorem 2.5 
under more complicated conditions. Also, it is possible to extend the proposed method 
to the nonparametric regression model with time- varying variances 

X, = f{i/n) + a^e,, (2.26) 

where /(•) is a nonparametric time trend of interest. Nonparametric estimates, for ex- 
ample, the Nadaraya-Watson estimate, are usually based on locally weighted observa- 
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tions. The latter feature allows us to derive similar self-normalized CLT. However, the 
change point problem for (1.4) and (2.26) will be more challenging, and Aue et al. [6] 
studied (1.4) for uncorrelated errors with constant variance. Also, it is more difficult 
to address the bandwidth selection issues; sec Altman [2] for related contribution when 
(7i = 1. It remains a direction of future research to investigate (1.4) and (2.26). 



3. Simulation study 

3.1. Selection of block length for f 

Recall that Di, . . . , Di^ in (2.25) are asymptotically i.i.d. normal random variables. To 
get a sensible choice of the block length parameter A;„, we propose a simulation- based 
method by minimizing the empirical mean squared error (MSE): 

(i) Simulate n i.i.d. standard normal random variables Zi, . . . , Z„. 

(ii) Based on Zi, . . . , Z„, obtain f with block length k. 

(iii) Repeat (i)-(ii) many times, compute empirical MSE(fc) as the average of realiza- 
tions of (f — 1)^, and find the optimal k by minimizing MSE(fc). 

We find that the optimal block length k is about 12 for n = 120, about 15 for n = 240, 
about 20 for n = 360, 600 and about 25 for n = 1200. 

3.2. Empirical coverage probabilities 

Let sample size n— 120. Recall and cr^ in (1.1). For at, consider four choices: 

Al: o-j = 0.21j<„/2 + 0.61i>„/2, 
A2: cr, = 0.2{l + cos2(i/n^/5)}, 
A3: fj, =0.2 4-0.1 log(H- |i-n/2|), 
A4: a, ^ 0.3 + 60), 

where (j) is the standard normal density, and 1 is the indicator function. The sequences 
A1-A4 exhibit different patterns, with a piecewise constancy for Al, a cosine shape for 
A2, a sharp change around time n/2 for A3 and a gradual downtrend for A4. Let Si be 
i.i.d. N(0, 1). For e^, we consider both linear and nonlinear processes. 



Bl: e, = {77,-i;(77,)}/VVar(?y,), where r/, = e\7n-i\ + - ^^e,, \0\ < 1. 
B2: Ci = aj£i_j, where a, = — ,/3 > 1/2. 



For Bl, by Wu [40], (2.8) holds. By Andcl, Nctuka and Svara [3], E{r],) = 9^/2jn and 
Va.r{r]i) = 1 — 26*^/71. To examine how the strength of dependence affects the perfor- 
mance, we consider = 0,0.4,0.8, representing independence, intermediate and strong 
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dependence, respectively. For B2 with /3 > 2, (2.2) holds with A„ = n^^^\og{n), and we 
consider three cases /3 = 2.1,3,4. To assess the effect of block length fc„, three choices 
kn = 8,10,12 arc used. Thus, wc consider all 72 combinations of {Al, A2, A3, A4} x 
{B1,6' = 0,0.4,0.8;B2,/3 = 2.1,3,4} x {fc„ = 8, 10, 12}. 

Without loss of generality we examine coverage probabilities based on 10"^ realized 
confidence intervals for yu = in (1.1). We compare our self- normalization-based confi- 
dence intervals to some stationarity-based methods. For (1.1), if we pretend that the error 
process {e^ = atCi} is stationary, then we can use (1.2) to construct an asymptotic confi- 
dence interval for fi. Under stationarity, the long-run variance of {ii} can be similarly 
estimated through the block method in Section 2.5 by using the non- normalized version 
Dj = \/^[X{j) — X] in (2.25); see Lahiri [29]. Thus, we compare two self-normalization- 
based methods and three stationarity-based alternatives: self-normalization-bascd confi- 
dence intervals through the asymptotic theory in Proposition 2.2 (SN) and the wild boot- 
strap (WB) in Section 2.3; stationarity-based confidence intervals through the asymp- 
totic theory (1.2) (ST), non-overlapping block bootstrap (BB) and studentized non- 
overlapping block bootstrap (SBB) in Section 2.3. From the results in Table 1, we sec that 
the coverage probabilities of the proposed self- normalization-based methods (columns SN 
and WB) are close to the nominal level 95% for almost all cases considered. By contrast, 
the stationarity-based methods (columns ST, BB and SBB) suffer from substantial un- 
dercoverage, especially when dependence is strong {6 = 0.8 in Table 1(a) and /? = 2.1 in 
Table 1(b)). For the two self- normalization-based methods, WB slightly outperforms SN. 

3.3. Size and power study 

In (1.3), wc use the same setting for ai and Cj as in Section 3.2. For mean /Xj, we consider 
= Ali>4o, A > 0, and compare the test statistics T^,T^ in (2.13) and T^^ in (2.16). 
First, we compare their size under the null with A = 0. The critical value of T^^ is 
obtained using the wild bootstrap in Section 2.4; for and T^, their critical values 
are based on the block bootstrap in Section 2.3. In each case, we use lO'^ bootstrap 
samples, nominal level 5%, and block length fc„ = 10, and summarize the empirical sizes 
(under the null A = 0) in Table 2 based on 10'^ realizations. While T^^ has size close to 
5%, and tend to over- reject the null, and the false rejection probabilities can be 
three times the nominal level of 5%. Next, we compare the size-adjusted power. Instead 
of using the bootstrap methods to obtain critical values, we use 95% quantiles of lO'^ 
realizations of the test statistics when data are simulated directly from the null model so 
that the empirical size is exactly 5%. Figure 1 presents the power curves for combinations 
{A1-A4} X {Bl with 9 = 0.4; B2 with (3 = 3.0} with 10^ realizations each. For Al, T^^ 
outperforms and T^; for A2-A4, there is a moderate loss of power for T,j^. Overall, 
T^^ has power comparable to other two tests. In practice, however, the null model is 
unknown, and when one turns to the bootstrap method to obtain the critical values, the 
usual CUSUM tests and will likely over-reject the null as shown in Table 2. In 
summary, with such small sample size and complicated time- varying variances structure, 
T^^ along with the wild bootstrap method delivers reasonably good power and the size 
is close to nominal level. 
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Table 1. Coverage probabilities (in percentage) for fi in (1.1) with from Bl [(a)] and B2 
[(b)]. Nominal level is 95%. SN and WB denote self-normalization-based confidence intervals 
using asymptotic theory in Proposition 2.2 and the wild bootstrap procedure, respectively; ST, 
BB, SBB denote stationarity-based confidence intervals using asymptotic theory in (1.2), non- 
overlapping block bootstrap and studentized non-overlapping block bootstrap, respectively 



e kn <Ti SN WB ST BB SBB ai SN WB ST BB SBB 



(a) Model Bl 



8 


Al 


98.0 


94.7 


93.1 


92.2 


92.8 


A2 


96.6 


95.2 


92.3 


92.5 


92.5 


10 




98.2 


95.0 


92.6 


92.4 


92.2 




94.6 


94.6 


90.0 


89.5 


89.4 


12 




98.1 


95.6 


91.7 


91.4 


91.1 




92.1 


93.7 


89.7 


89.5 


89.6 


8 


A3 


96.4 


95.0 


92.5 


92.3 


92.0 


A4 


96.6 


95.6 


93.1 


92.6 


93.0 


10 




94.7 


94.7 


90.8 


90.6 


90.6 




95.1 


95.1 


91.4 


91.3 


91.3 


12 




93.7 


94.8 


90.8 


90.4 


90.5 




92.9 


93.7 


89.8 


89.7 


89.5 


8 


Al 


98.7 


95.9 


92.7 


92.6 


92.9 


A2 


96.6 


95.3 


92.5 


92.4 


92.0 


10 




98.5 


95.7 


92.8 


92.7 


92.3 




95.4 


95.4 


91.6 


91.1 


91.6 


12 




98.0 


95.0 


90.8 


90.8 


90.2 




92.5 


94.0 


89.4 


89.1 


89.4 


8 


A3 


96.6 


95.2 


91.7 


91.7 


91.6 


A4 


95.4 


94.1 


90.8 


90.9 


90.6 


10 




95.3 


95.5 


91.5 


91.3 


91.5 




95.0 


94.8 


91.2 


90.7 


90.8 


12 




93.1 


94.6 


90.2 


89.9 


89.9 




94.1 


95.1 


90.3 


89.8 


90.1 


8 


Al 


97.9 


94.6 


87.8 


86.8 


87.3 


A2 


96.1 


94.7 


87.2 


87.3 


87.0 


10 




97.6 


95.5 


87.3 


87.0 


86.7 




93.3 


92.9 


86.4 


86.8 


86.1 


12 




97.3 


94.0 


85.8 


85.5 


85.1 




92.6 


93.4 


86.5 


86.4 


86.4 


8 


A3 


94.8 


93.5 


85.7 


85.7 


86.0 


A4 


95.5 


94.7 


86.3 


86.1 


86.1 


10 




93.5 


93.8 


85.7 


85.5 


85.2 




95.3 


95.1 


88.5 


88.3 


88.5 


12 




92.4 


93.3 


87.2 


86.7 


86.9 




92.6 


94.2 


86.3 


85.8 


85.7 



/3 



(b) Model B2 



8 


Al 


97.6 


94.9 


91.8 


91.4 


91.9 


A2 


95.9 


94.2 


91.9 


92.0 


91.1 


10 




97.7 


93.2 


88.9 


88.1 


88.3 




95.7 


95.7 


92.1 


91.8 


92.1 


12 




97.9 


95.5 


90.7 


90.2 


90.0 




93.3 


94.6 


90.0 


89.9 


89.7 


8 


A3 


94.6 


93.3 


89.8 


89.5 


89.5 


A4 


95.6 


94.7 


91.3 


91.7 


91.0 


10 




95.1 


95.2 


91.6 


91.4 


91.5 




95.4 


95.9 


92.8 


92.2 


93.0 


12 




93.8 


95.4 


90.8 


90.6 


90.2 




93.9 


94.9 


88.9 


88.5 


88.6 


8 


Al 


99.1 


95.7 


91.1 


91.0 


91.2 


A2 


95.8 


94.6 


90.4 


89.8 


90.1 


10 




98.5 


96.4 


91.6 


90.9 


91.1 




95.6 


95.2 


92.1 


91.9 


91.5 


12 




97.9 


94.6 


89.6 


89.3 


89.0 




94.1 


95.0 


90.5 


90.2 


90.4 


8 


A3 


95.9 


94.6 


92.0 


91.9 


91.7 


A4 


96.0 


94.5 


90.6 


90.4 


90.3 


10 




94.3 


94.4 


90.0 


89.9 


89.8 




94.3 


94.4 


89.2 


89.3 


88.9 


12 




93.2 


94.5 


88.9 


88.6 


88.7 




93.1 


94.1 


89.6 


88.9 


88.8 


8 


Al 


97.1 


92.5 


86.2 


86.2 


85.5 


A2 


95.7 


93.8 


88.9 


89.0 


88.7 


10 




97.6 


94.7 


89.2 


88.9 


88.6 




93.5 


93.6 


88.8 


88.8 


88.4 


12 




97.2 


95.1 


87.9 


87.5 


87.7 




92.6 


93.9 


88.0 


87.6 


87.7 


8 


A3 


94.0 


93.7 


88.5 


88.4 


88.3 


A4 


95.0 


93.1 


88.8 


88.7 


88.6 


10 




93.3 


93.8 


88.1 


87.9 


87.8 




94.1 


94.2 


89.1 


88.8 


89.1 


12 




92.9 


94.7 


89.1 


88.4 


88.4 




91.5 


92.6 


87.7 


87.5 


87.5 
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Table 2. Size (in percentage) comparison of and in (2.13) and T^^ in (2.16), with sample 
size n = 120, nominal level 5%, and block length kn ~ 10 



Model Bl Model B2 





e 


^ n 


71 




P 


^ n 


n 




Al 


0.0 


4.9 


9.1 


8.4 


2.1 


7.3 


12.2 


13.4 




0.4 


4.7 


9.4 


9.6 


3.0 


4.7 


8.6 


9.2 




0.8 


6.0 


15.1 


14.7 


4.0 


5.6 


9.9 


7.7 


A2 


0.0 


5.7 


8.2 


6.1 


2.1 


5.8 


9.5 


8.6 




0.4 


6.1 


8.9 


6.8 


3.0 


5.3 


9.6 


6.8 




0.8 


7.3 


12.6 


9.3 


4.0 


4.2 


7.5 


4.2 


A3 


0.0 


5.0 


5.7 


4.8 


2.1 


5.5 


7.7 


6.7 




0.4 


5.3 


6.9 


5.4 


3.0 


5.8 


6.1 


4.9 




0.8 


7.0 


9.8 


10.0 


4.0 


5.0 


6.5 


4.2 


A4 


0.0 


5.4 


8.4 


6.0 


2.1 


6.9 


8.8 


7.1 




0.4 


5.7 


7.9 


5.2 


3.0 


4.8 


6.6 


6.3 




0.8 


7.2 


11.1 


9.2 


4.0 


5.3 


6.2 


5.8 



Finally, we point out that the proposed self-nornialization-based methods are not ro- 
bust to models with time- varying correlation structures. For example, consider the model 
Ci = 0.3ei_i -I- Ei for 1 < I < 60 and e; = 0.8ei_i +ei for 61 < i < n, where Si are i.i.d. N(0, 
1). With fc„ = 10, the sizes (nominal level 5%) for the three tests T^^ , T^, are 0.154, 
0.196, 0.223 for Al. Future research directions include (i) developing tests for change in 
the variance or covariance structure for (1.1) (See Inclan and Tiao [26], Aue et al. [5] and 
Berkes, Gombay and Horvath [10] for related contributions); and (ii) developing methods 
that are robust to changes in correlations. 



4. Applications to two real data sets 

4.1. Annual mean precipitation in Seoul during 1771—2000 

The data set consists of annual mean precipitation rates in Seoul during 1771-2000; 
see Figure 2 for a plot. The mean levels seem to be different for the two time periods 
1771-1880 and 1881-2000. Ha and Ha [22] assumed the observations are i.i.d. under the 
null hypothesis. As shown in Figure 2, the variations change over time. Also, the auto- 
correlation function plot (not reported here) indicates strong dependence up to lag 18. 
Therefore, it is more reasonable to apply our self-normalization-based test that is tailored 
to deal with modulated stationary processes. With sample size n ~ 230, by the method 
in Section 3.1, the optimal block length is about 15. Based on 10^ bootstrap samples as 
described in Section 2.4, we obtain the corresponding p-values 0.016, 0.005, 0.045, 0.007, 
with block length fc„ = 12, 14, 16, 18, respectively. For all choices of fc„, there is compelling 




(A1,B2) (A2,B2) (A3,B2) (A4,B2) 




0.0 0.3 0.6 0.0 0.3 0.6 0.0 0.4 0.8 0.0 0.4 0.8 

lambda lambda lambda lambda 



Figure 1. Size-adjusted power curves for (dashed curve) and (dotdash curve) in (2.13) 
and T^^ (solid curve) in (2.16) as functions of change size A (horizontal axis) with sample size 
n = 120 and block length fc„ — 10. For (Al, B1)-(A4, Bl), the error process {ci} is from Bl 
with e = 0.4; for (Al, B2)"(A4, B2), the error process {e^} is from B2 with /3 = 3.0. 



evidence that a change point occurred at year 1880. While our result is consistent with 
that of Ha and Ha [22] , our modulated stationary time series framework seems to be more 
reasonable. Denote by fii and fi2 the mean levels over pre-change and post-change time 
periods 1771-1880 and 1881-2000. For the two sub-periods with sample sizes 110 and 
120, the optimal block length is about 12. With A:„ = 12, applying the wild bootstrap in 
Section 2.3 with 10^ bootstrap samples, we obtain 95% confidence intervals [121.7, 161.3] 
for fii, [100.9, 114.3] for fi2- For the difference fXi — ^2, with optimal block length A:„ = 15, 
the 95% wild bootstrap confidence interval is [19.6,48.2]. Note that the latter confidence 
interval for — /j,2 does not cover zero, which provides further evidence for /zi ^ /X2 and 
the existence of a change point at year 1880. 



4.2. Quarterly U.S. GNP growth rates during 1947-2002 



The data set consists of quarterly U.S. Gross National Product (GNP) growth rates from 
the first quarter of 1947 to the third quarter of 2002; see Section 3.8 in Shumway and 
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Figure 2. Annual mean precipitation in Seoul from 1771-2000. 

Stoffer [38] for a stationary autoregressive model approach. However, the plot in Figure 3 
suggests a non-stationary pattern: the variation becomes smaller after year 1985 whereas 
the mean level remains constant. Moreover, the stationarity test in Kwiatkowski et al. [28] 
provides fairly strong evidence for non-stationarity with a p- value of 0.088. With the block 
length k„ = 12, 14, 16, 18, wc obtain the corresponding p- values 0.853,0.922,0.903,0.782, 
and hence there is no evidence to reject the null hypothesis of a constant mean ^. Based 
on kn = 15, the 95% wild bootstrap confidence interval for ^ is [0.66%, 1.00%]. To test 
whether there is a change point in the variance, by the discussion in the last paragraph 
of Section 2.4, wc consider Xi = (X; — K.n)'^- With fc„ = 12, 14, 16, 18, the corresponding 
p-valucs arc 0.001,0.006,0.001,0.010, indicating strong evidence for a change point in 
the variance at year 1984. In summary, we conclude that there is no change point in the 
mean level, but there is a change point in the variance at year 1984. 
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Figure 3. Quarterly U.S. GNP growth rates from 1947-2002. 
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Appendix: Proofs 



Z. Zhao and X. Li 



Proof of Theorem 2.1. Let rj = \aj \ +^21=2 ~ By the triangle inequality, we 

have rj < r„. Recall Si in (2.2). By the summation by parts formula, (2.6) follows via 



'^j'^Bj + ^{(Ji - ai+i)TBi + Os..s.{r 



(A.l) 



r^a,(B, -B,_i) + Oa.s.( 



i=l 

By Kolmogorov's maximal inequality for independent random variables, for d > 0, 

^ 2n 



P< max 

l<j<n 



^(7,(5, -B,_i) 



^(T,(B,-B,_i) 



Thus, by (A.l), maxi<j<„ \Fj \ = Op(E„ + r„A,i). Observe that 

y2 _ s2 = ly, - F^/j, where W^, = ^ a^e^ - !)• 



= 5" 



(A.2) 



(A.3) 



By (2.2), the same argument in (A.l) and (A. 2) shows Wj = Op(i;*^ + r* A„), uniformly. 
The desired result then follows via (A.3). □ 

Proof of Theorem 2.2. Denote by <i>(x) the standard normal distribution function. 
By Proposition 2.2 and Slutsky's theorem, P(7J„/f < a;) — )• $(x) for each fixed x S M. 
Since $(x) is a continuous distribution, sup^jgR |P(if„/f < x) — $(x)| = 0. It remains to 
prove sup^gjj \F*{H* < x) — —?' 0, in probability. Notice that, conditioning on {e,}, 
{Ci} independent random variables with zero mean. By the Berry-Esscen bound in 
Bentkus, Bloznelis and Gotze [9], there exists a finite constant c such that 



sup nn: < x) - Hx)\ < cj2E*m')\Y,E*i\^,\') 



'3/2 



(A.4) 



where E* denotes conditional expectations given {ei}. Clearly, E*{\£^i\'^) = afe^E{a\) and 
£;(|^,|3) =crf|ef|£;(|af I). Thus, under the assumption e, e £^ we have Er=i^*(ICil^) = 
Op(Er=i'^?)- Meanwhile, by the proof of Theorem 2.1, EIli = 111=1^1^1 = 

{1 + Op(l)} Y^^=i ^1- Therefore, the desired result follows from (A.4) in view of (2.11). □ 
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Proof of Theorem 2.3. For cn <j < {1 — c)n, c < (1 — j/'n),j/n < 1 — c. For Sx{j) in 
(2.14), by (2.6), wc have maxc„<j<(i„c)« \Sx{j) ~ TSx{j)\ = Oa.s.(fnA„), where 

■ \ i . n 

Va,(i?,-i?,_i)-^ 
n I ^-^ n 



5x(j)= (l-^ )^cT,(B,-B,_i)-^ cj,{B,~B,_i). 

i=]+i 



By (2.7), max,„<,<(i_,)„ \{l-3/nfV^^ + {j/nfV^ - Vf\ = Op(tJ7„), where 

I^=^ = (l-j7n)2^a^f + (j7n)2 '^•^ n7„ ^ (rfA'. + S?J/n + + <A„. 

For cn < j < (1 - c)n, Vf > c^Yj\. Thus, condition (2.10) imphes = o(F/) and {F/ - 
Op(cc7„)}^/'^ ^ + Op{wn/Vj). Therefore, uniformly over cn < j < (1 — c)n, 

rrf, ^..N 'r5'x(i) + Oa.s.(?-nA„) T'gA-(j) ^ f r„A„ n7„S^x(j) 



By (A. 2), maxj |5x(.7)| = Op(I]„). Thus, the resuh foUows in view of Vj > cl]„. □ 

Proof of Theorem 2.4. Condition A/„ ^ imphes maxi<j<() 7'(j)A„/I](j) ^ 0. By 
(2.7), 



Define ?7j = S^^(j) X)iei ~ ^i-i)- Clearly, [/i, . . . , are independent standard 

normal random variables. Thus, maxi<j<£^ \Uj \ = Op{-\/log(f„)} = Op{^log(?i)}. By 
(2.6), X„ - /X = Op{(I]„ + r„A„)/n} = Op{Y,„ /n). Recah the definition of in (2.18). 
By the same argument in (2.6), using \/l + x = 1 + 0(a;) as x — > 0, we have 

^ ^ K{x{J)^^i} 1 fc„(M-xj 1 

^ s(j) ymj; s(j) ym^ 



^(j)A„ 



{l + 0(c.,)} + Op 



h y 



m 

= rC/,+Op{VkiRM„ + ^}. 
By the latter expression and log(ri)M„ 0, we can easily verify — = Op(xn)- O 
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